SVM-based miRNA: MiRNA∗ duplex prediction
نویسندگان
چکیده
We address the problem of predicting the miRNA:miRNA* duplex stemming from a microRNA (miRNA) hairpin precursor and we present a SVM-based methodology to address it. Predicting the miRNA:miRNA* duplex is a first step towards identifying the mature miRNA, suggesting possible miRNA targets and ultimately, reducing experimentation effort, time, and cost. We measure the error in terms of the absolute difference of the true and predicted location of all of the four ends of the duplex and/or of each end separately. Our mean absolute error over all ends is 1.61 ± 2.24 nts as measured on a hold-out set of 220 miRNA hairpin precursor sequences. In addition, our tool precisely predicts (with 0 nt deviation) the starting position for 57% and 52% of the miRNAs in the 5’ and 3’ strands of the same dataset, significantly outperforming the state-of-the-art tool MaturePred which achieves 18% and 12%, respectively, on the same task. Overall, our method accurately identifies not only the starting nucleotide of novel miRNA:miRNA* duplexes –and thus individual miRNAsbut also their length, while outperforming the current state-of-the-art tool. KeywordsmiRNA:miRNA*; duplex; microRNA; SVM; Dicer.
منابع مشابه
Learning to Predict miRNA-mRNA Interactions from AGO CLIP Sequencing and CLASH Data
Recent technologies like AGO CLIP sequencing and CLASH enable direct transcriptome-wide identification of AGO binding and miRNA target sites, but the most widely used miRNA target prediction algorithms do not exploit these data. Here we use discriminative learning on AGO CLIP and CLASH interactions to train a novel miRNA target prediction model. Our method combines two SVM classifiers, one to p...
متن کاملNew support vector machine-based method for microRNA target prediction.
MicroRNA (miRNA) plays important roles in cell differentiation, proliferation, growth, mobility, and apoptosis. An accurate list of precise target genes is necessary in order to fully understand the importance of miRNAs in animal development and disease. Several computational methods have been proposed for miRNA target-gene identification. However, these methods still have limitations with resp...
متن کاملMiRduplexSVM: A High-Performing MiRNA-Duplex Prediction and Evaluation Methodology
We address the problem of predicting the position of a miRNA duplex on a microRNA hairpin via the development and application of a novel SVM-based methodology. Our method combines a unique problem representation and an unbiased optimization protocol to learn from mirBase19.0 an accurate predictive model, termed MiRduplexSVM. This is the first model that provides precise information about all fo...
متن کاملImproving classification of mature microRNA by solving class imbalance problem
MicroRNAs (miRNAs) are ~20-25 nucleotides non-coding RNAs, which regulated gene expression in the post-transcriptional level. The accurate rate of identifying the start sit of mature miRNA from a given pre-miRNA remains lower. It is noting that the mature miRNA prediction is a class-imbalanced problem which also leads to the unsatisfactory performance of these methods. We improved the predictio...
متن کاملPlant microRNA-Target Interaction Identification Model Based on the Integration of Prediction Tools and Support Vector Machine
BACKGROUND Confident identification of microRNA-target interactions is significant for studying the function of microRNA (miRNA). Although some computational miRNA target prediction methods have been proposed for plants, results of various methods tend to be inconsistent and usually lead to more false positive. To address these issues, we developed an integrated model for identifying plant miRN...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012